What’s this?

This is a section of a paper I am slowly working on. I have extracted it so that I can share it and think through with people. The background of this is that I am trying to use rejection sampling alone to condition ABMs when I have little data. I have used rejection sampling tons in Indonesia but precisely because there is little data it’s hard then to validate the tool.

So what I did is taking a data-rich set of papers (Makino et al. 2017; Watari et al. 2017) on the タチウオ fishery in the Bungo channel and tried to re-create it by pretending to know less than what the original papers knew.
The application is actually pretty cool as it specifies quite nicely the economic constraints of the japanese boats which makes it perfect for our agent-based model.

In the end the focus of the paper and this section is more on value of evidence using models instead of our usual bio-economic policy analysis.

Description

In this example we model the Japanese coastal trolling fishery based in Usuki as described in Makino et al. (2017). As of 2011 it was composed of 45 boats, all smaller than 5 gross tons, targeting hairtail (Trichiurus japonicus). Sale price of hairtail depends on its size and letting it grow for 6 months can triple its value. The contribution of the Usuki trolling fishery to the overall fishing mortality is however limited as trolling boats share the stock with purse seiners who target other species and land hairtail as a byproduct. The fishery is not data-limited as Watari et al. (2017) presents a full stock assessment. The purpose of this example is one of data-degradation: what changes when we pretend to have less data than what is actually available.

Makino et al. (2017) focuses on the economic side of the fishery: number of fishers is in decline, most of them earn about 5-6M JPY which covers fixed and living costs but is not enough to save the 25M JPY needed to build a new boat. The focus on individual boats makes it ideal for an agent-based model analysis and here we create a simple POSEIDON (Bailey et al. 2018) application for it.

We sketch here the key ideas of the model but leave the full details in the appendix. The biological operating model is length-based, with 5cm length-bins using the fixed boxcar method (Goudriaan 1986). Recruitment is noisy and happens twice a year (spring and autumn) assuming a Beverton-Holt stock-recruitment relationship. Geographically fish is spread out over a 5-by-5 grid, each cell a square 12km wide. We model each trolling boat separately. Boats can fish any day of the year and choose the fishing location by trial and error through the explore-exploit-imitate algorithm (Carrella, Bailey, and Madsen 2019).
We track the profits made by each boat. Those that manage to accumulate more cash than their target savings (after accounting for living and fixed costs) spawn a successor (that is, create a new boat that joins the fishery). Those that fail to cover their living costs will eventually leave.
The purse seiner fleet is not agentized and is instead a simple constant fishing mortality \(F\) applied to the stock at the end of each year.

We run the model for at most 45 years, starting the trolling fishery ab novo at year 1. We condition the model on only six pieces of information. First, total landings (trolling and purse seiners combined) in the fishery have never been above 15,000t. This is evident from figure 2 in Watari et al. (2017). Second, current landings of the Usuki trolling fleet is between 250t and 1,850t. The lower bound is used as an historical target in Makino et al. (2017), the upper bound is the non-net fishing mortality computed for 2011 in Watari et al. (2017). The upper bound is probably too high as it includes other sources of fishing mortality. Third, current SPR is unsustainable but not catastrophically so and we will accept fisheries whose current SPR is between .10 and .25. Watari et al. (2017) cites an SPR for the fishery of .21 in 2011. Fourth, the Usuki trolling fishery today has less than 60 active boats. Fifth, the Usuki trolling fishery represents at most 30% of the total landings for the stock. Sixth, the fishery is at least 30 years old.

Because our model is length rather than age based, we compute SPR via a length-based approximation, assuming median life-history parameters for the species (using Fishlife from Thorson et al. 2017; rFishBase from Boettiger, Lang, and Wainwright 2012). In other words, when computing SPR we do not assume we know the correct life-history parameters within the simulation and as such we expect this filter to be quite noisy.

This POSEIDON application is simple and lacks many details, inter alia it assumes knife-edge maturity, a simple geography, no seasonal variation in ex-vessel prices and simple stochasticity for the bi-annual spawning pulses. In spite of its simplicity it still contains too many parameters (see table ??) to hope for identification given the little data we condition with.

Table 1: A table with all unknown parameters from the POSEIDON simulation we need to randomize. Some (like cost data) have narrow priors but others (like catchability and steepness) are wide
Variable Distribution Meaning Source
catchability \(U[10^{-5},10^{-9}]\) % of biomass caught per hour spent fishing -
\(S_1\) \(U6.2,9.3]\) Logistic selectivity, first parameter (size limit at 25cm)
\(S_2\) \(U[0.17,0.26]\) Logistic selectivity, second parameter (size limit at 25cm)
Cell with most biomass \(U[1,5] \times U [1,5]\) Cell in the 5x5 grid with the most biomass -
Biomass smoothing \(U[0.1,1]\) Parameter defining the geographical dispersion of biomass -
Virgin recruits \(R_0\) \(U[37M,62M]\) Number of annual recruits at virgin biomass \(R_0\) that would generate 10,000t to 200,000t of virgin biomass
\(\Phi\) \(U[0.27,0.31]\) Ratio \(R_0\) to spawning stock biomass Implied by knife-edge maturity from Fishlife median \(L_{\text{mat}}\)
Steepness \(U[0.2,0.99]\) Proportion of current recruits to virgin when 20% of biomass left -
Initial depletion \(U[0.7,1]\) Proportion of biomass to virgin available as simulation starts -
Life history parameters \(L_{\infty},K,L_{\text{mat}},M\) Drawn from Fishlife Parameters governing the speed of growth for all fish Drawn from Fishlife multi-normal distribution
Allometric parameters \(a,b\) Bootstrap sampled from rFishbase Parameters converting fish length to weight Sampled with replacement from Fishbase
Hourly variable cost (JPY/hr) \(U[100,140]\) Monetary cost per hour spent at sea (fuel,engine) Makino et al. (2017)
Hourly effort cost (JPY/hr) \(U[700,800]\) Monetary cost per hour spent fishing (film, bait and boxing) Makino et al. (2017)
Hold size (kg) \(U[1000,3000]\) Maximum amount of fish transportable by a boat 5GT boats
Savings target (JPY) \(U[20M,30M]\) Money to accumulate to spawn additional boat Makino et al. (2017)
Yearly Expenses \(U[4.5M,7M]\) Yearly expenditure (living costs, plus fixed costs) Makino et al. (2017) and census
Daily probability of fishing \(U[.75,.9]\) Probability of going fishing each day of the year Comparing boat numbers with total effort recorded
Yearly exogenous fishing mortality \(U[.3,.8]\) Mortality rate due to other fishers targeting the stock -

Value of evidence

We run the model 354,775 times and accept only 743 fisheries. The extremely low acceptance rate (0.2%) is explained by the unnecessarily wide priors, particularly in catchability, steepness and exogenous fishing mortality. Many draws from the prior are fisheries that are not economically viable or biologically sustainable for more than a few years. In terms of marginal distributions most posteriors have not shifted from their priors but many of the parameters that were independent are now correlated.

The joint distribution of \(M\) (natural mortality) and \(K\) (growth coefficient) is particularly affected by rejection sampling. The prior of \(\frac{M}{K}\), which we inherit from Fishlife, is bell-shaped and centered around 1.3. The posterior is still bell-shaped but shifted to the left (median of 1.09).

We show in figure 1 how evidence would shift this posterior further. In subfigure B we draw the two hypothetical posteriors had we accepted SPR above .3 or below .1 instead of the .1-.25 interval we are actually using. Assuming high SPR moves the \(\frac{M}{K}\) posterior left, while low SPR moves posterior right and makes it bimodal. This in essence means that low SPR could be a symptom of low \(\frac{M}{K}\) (a fish that naturally grows slower and dies quicker) rather than any true signal of stock health. Sensitivity of length based SPR to mispecifications of \(\frac{M}{K}\) is well understood (Hordyk et al. 2014, 2019; Froese et al. 2019b, 2019a) and here we are simply picking up the conflict between Fishlife’s wide priors and using its modal values to compute SPR. In other words, on its own, length-based SPR could be either a sign of overfishing or a sign of mispecified life history parameters.

Ratio between natural mortality M and growth rate K; subfigure A shows prior and posterior distribution of the $\frac{M}{K}$, the prior is drawn from FishLife and the posterior represents the fisheries that passed all filters, the posterior has shifted mostly to the left ;subfigure B compares the $\frac{M}{K}$ posterior had we only SPR above 30 (cyan) or below 10 (green); subfigure C shows the posterior achieved by accepting only fisheries who have more than 60 boats active today (orange).

Figure 1: Ratio between natural mortality M and growth rate K; subfigure A shows prior and posterior distribution of the \(\frac{M}{K}\), the prior is drawn from FishLife and the posterior represents the fisheries that passed all filters, the posterior has shifted mostly to the left ;subfigure B compares the \(\frac{M}{K}\) posterior had we only SPR above 30 (cyan) or below 10 (green); subfigure C shows the posterior achieved by accepting only fisheries who have more than 60 boats active today (orange).

A far more counterintuitive effect of evidence is the limit on the number of boats (no more than 60 boats in Usuki today) as shown in figure 1, subfigure C. If we accept only runs with more than 60 boats, the \(\frac{M}{K}\) posterior is bi-modal and shifts further to the right: we expect a higher \(\frac{M}{K}\). This is counter-intuitive because higher \(\frac{M}{K}\) means that the fish grows slower and is less likely to get to maturity. We would expect this stock to be able to sustain only a smaller fleet.

What we uncovered is a collider bias (a Monty Hall effect as in Burns and Wieth 2004). It is true that higher \(\frac{M}{K}\) implies a more vulnerable fishery: the higher \(\frac{M}{K}\) in the prior the fewer years it takes for a random fishery to collapse. High \(\frac{M}{K}\) also implies less economic viability: larger fish is paid more and the higher \(\frac{M}{K}\) gets the less likely the fish is to get large enough to command a price premium. This however is a priori knowledge: a posteriori we are not looking at the whole space of fisheries but only the specific space of fisheries that survived until today (at least \(250t\) landed in Usuki this year) and still representing only a portion of total landings.

Fisheries with high \(\frac{M}{K}\) that survive are quantity-driven rather than value-driven. If fish matures slowly and is less likely to survive, boats need to catch it when it is shorter. If short fish sells for less, boats need to catch more of it per unit of effort. Knowing both that \(\frac{M}{K}\) is high and that the fishery survived implies that other model parameters must change to make it so. This is done mostly by exploiting the wide priors on weight-length parameters \(a,b\). In the prior there is no correlation between \(\frac{M}{K}\) and weight-length parameters, in the posterior the correlation is 0.88: in an accepted fishery with \(\frac{M}{K}=1.2\) a 50cm fish weighs 200g, but at \(\frac{M}{K}=2\) it has to weigh 600g. The reverse wouldn’t work: high \(\frac{M}{K}\) with low weight would be economically unviable, low \(\frac{M}{K}\) with high weight would exceed landings limits (either in absolute tonnage or relative to the purse seine fleet). Therefore, high \(\frac{M}{K}\) in the posterior implies a quantity-driven fishery with a high number of boats.

This collider effect can work to our advantage. Even with data, \(\frac{M}{K}\) and in particular natural mortality \(M\) are hard to estimate and drive many stock assessment results (Mangel et al. 2013). Here however we have connected the ratio to weight-length parameters which can be estimated cheaply. The heavier the fish, the higher our expected \(\frac{M}{K}\) becomes.

The same collider effect explains an even more puzzling prediction: runs with lower SPR will “do better” (attract more boats) in the future, as shown in figure 2. As explained above, low SPR is associated with high \(\frac{M}{K}\) and high \(\frac{M}{K}\) is associated with quantity-driven fisheries with many boats. High participation fisheries are usually filtered away because of our 60 boats limit but participation today may be temporarily low due to recent recruitment failures (critically 10 to 5 years ago). In context then, a low SPR may indicate a high participation fishery that is currently under-strength. This kind of fishery will eventually snap back to its long-term high participation equilibrium.

Projected and past realizations of participation and recruitment for accepted fisheries as well as fisheries that would have been accepted had SPR evidence been different. Each line is a separate simulation, bolded lines are median values. The subset of runs with low SPR and also M/K ratio above two have been grouped separately (in black). Low SPR today implies higher participation tomorrow if associated with higher M/K. This is better explained by looking at recruitment (subfigure B): these fisheries in equilibrium have higher participation but are over-correcting to low recruitment pulses 5 to 10 years ago and are likely to rebound.

Figure 2: Projected and past realizations of participation and recruitment for accepted fisheries as well as fisheries that would have been accepted had SPR evidence been different. Each line is a separate simulation, bolded lines are median values. The subset of runs with low SPR and also M/K ratio above two have been grouped separately (in black). Low SPR today implies higher participation tomorrow if associated with higher M/K. This is better explained by looking at recruitment (subfigure B): these fisheries in equilibrium have higher participation but are over-correcting to low recruitment pulses 5 to 10 years ago and are likely to rebound.

We can generate another counter-intuitive result by focusing on evidence we did not use. Makino et al. (2017) mentions how the number of Usuki fishers have decreased by 40% in the last 10 years. We did not use this figure because it is the total loss across all fisheries, not hairtail specific. In this paragraph however, we add this as another piece of evidence. Again rejection sampling helps us contextualize this piece of information: a large drop in boats may mean an overfished stock, a series of weak recruitments, or a too aggressive undershooting in participation that will self-correct as profits return. In figure 3 we show future participation splitting the accepted fisheries between those that suffered at least a 40% boat reduction in the last ten years and the fisheries that didn’t. Within 15 years there is no real difference in participation and in fact fisheries that declined rapidly tend to see higher participation in the short run.

Projected fishery participation for the next 15 years. Each line is a separate accepted fishery, bolded lines are yearly medians. Accepted fisheries were split into two groups: those that saw at least a 40% decline in boats within the last ten years and those that didn't. Large participation decline is not a factor predicting further drops in the long run and to the contrary may mean larger participation in the short run.

Figure 3: Projected fishery participation for the next 15 years. Each line is a separate accepted fishery, bolded lines are yearly medians. Accepted fisheries were split into two groups: those that saw at least a 40% decline in boats within the last ten years and those that didn’t. Large participation decline is not a factor predicting further drops in the long run and to the contrary may mean larger participation in the short run.

Policy analysis

For the accepted fisheries, business as usual will not result in any long-term change in the number of fishers or any other important indicator. This we show in figure 4. It remains however a depleted fishery with biomass at approximately 15% of carrying capacity, justifying additional policies.

Projected participation and biomass for the accepted fisheries when no policy is implemented. Each line is a simulation,the bolded line is the median yearly value. While each individual run will exhibit noise and short term trends, the overall dynamic is one of long-term equilibrium around current values.

Figure 4: Projected participation and biomass for the accepted fisheries when no policy is implemented. Each line is a simulation,the bolded line is the median yearly value. While each individual run will exhibit noise and short term trends, the overall dynamic is one of long-term equilibrium around current values.

We assume here that we can only enforce policies on the Usuki trolling fleet and cannot affect fishing mortality outside of it. Because we constrained the Usuki fishery to represent at most 30% of the landings, we cannot expect regulations to achieve much rebuilding.

We can still make a difference for the economic health of this fleet in two ways. First, we could try marginally improving biomass in order to increase CPUE and therefore profitability. Second, we could try changing selectivity to increase the average size of fish caught and therefore earning more money for each ton of fish landed.

We compare three policies. The first is just closing down the fishery to new entrants and re-entry. The second combines the first policy with imposing a limit of 200 days a year for fishing. The third is to combine the first policy with mandating a new gear that catches larger fish (specifically, increases by 10% \(S_1\) and decreases by 10% \(S_2\) of the logistic selectivity).

While seasonal closures do achieve higher profits per trip and higher selectivity increases revenue per unit caught, these benefits are negated by the purse seine non-Usuki fishery which increases its relative importance in landings. Seasonal closures generate higher profits per trip but these fail to cover fixed costs. If the overall objective is either increased profits or higher biomass, banning entry/re-entry alone performs just as well as more complex policies. We show these results in figure 5.

Key indicators projections for three policies (no entry or re-entry, 200 days fishing limit, higher selectivity); each line is a separate fishery, bolded lines are the median yearly value. Better selectivity increases revenue per kg caught (subfigure D) and fishing limit increases profits per trip (subfigure B). In general however all policies cause more landings to occur outside the Usuki trolling fishery (subfigure E) and selectivity improvements do not improve overall savings any better than just closing to new entrants (subfigure  F)

Figure 5: Key indicators projections for three policies (no entry or re-entry, 200 days fishing limit, higher selectivity); each line is a separate fishery, bolded lines are the median yearly value. Better selectivity increases revenue per kg caught (subfigure D) and fishing limit increases profits per trip (subfigure B). In general however all policies cause more landings to occur outside the Usuki trolling fishery (subfigure E) and selectivity improvements do not improve overall savings any better than just closing to new entrants (subfigure F)

Comparison to data-rich results

Biomass in the stock assessment produced in Watari et al. (2017) is estimated at 4,896t in 2011. Here the mean current biomass is 3,477t with standard deviation of 1,806t. We are about one standard deviation away using only limited evidence.

The main difference between our work and the data-rich simulations in Makino et al. (2017) is that our results are far more optimistic in the status quo. In the original data-rich work the unmanaged fishery constantly deteriorates, both biologically and economically. Our projections instead show a long term equilibrium around current values.

The discrepancy can be explained by the underlying differences in the economic model. In our paper boats quit when earnings dry up and bank balances turn negative. This helps the fishery recover over time and avoid collapses. In Makino et al. (2017) however, fishing mortality is a free parameter and as such fishing pressure can continue unabated.

One contributing factor for discrepancies is also our choice for life-history parameters. We used “off-the-shelf” FishLife distributions as priors. These are wide and not necessarily centered near the real values from the stock assessment. For example, in its stock assessment Watari et al. (2017) assumes maximum age for hairtail at 5 (the maximum ever observed in the area). Using Quinn and Deriso (1999) thumb rule (Hordyk et al. 2014) this would imply a natural mortality of at least 0.9. Fishlife average is 0.39 with a standard deviation of 0.11.

References

Bailey, Richard M., Ernesto Carrella, Robert Axtell, Matthew G. Burgess, Reniel B. Cabral, Michael Drexler, Chris Dorsett, Jens Koed Madsen, Andreas Merkl, and Steven Saul. 2018. “A computational approach to managing coupled human–environmental systems: the POSEIDON model of ocean fisheries.” Sustainability Science, June, 1–17. https://doi.org/10.1007/s11625-018-0579-9.

Boettiger, C., D. T. Lang, and P. C. Wainwright. 2012. “Rfishbase: Exploring, manipulating and visualizing FishBase data from R.” Journal of Fish Biology 81 (6): 2030–9. https://doi.org/10.1111/j.1095-8649.2012.03464.x.

Burns, Bruce D., and Mareike Wieth. 2004. “The collider principle in causal reasoning: Why the Monty Hall dilemma is so hard.” Journal of Experimental Psychology: General 133 (3): 434–49. https://doi.org/10.1037/0096-3445.133.3.434.

Carrella, Ernesto, Richard M. Bailey, and Jens Koed Madsen. 2019. “Repeated discrete choices in geographical agent based models with an application to fisheries.” Environmental Modelling and Software 111 (January): 204–30. https://doi.org/10.1016/j.envsoft.2018.08.023.

Froese, Rainer, Henning Winker, Gianpaolo Coro, Nazli Demirel, Athanassios C Tsikliras, Donna Dimarchopoulou, Giuseppe Scarcella, Wolfgang Nikolaus Probst, Manuel Dureuil, and Daniel Pauly. 2019a. “On the pile-up effect and priors for Linf and M/K: response to a comment by Hordyk et al. on ‘A new approach for estimating stock status from length frequency data’.” ICES Journal of Marine Science 76 (2): 461–65. https://doi.org/10.1093/icesjms/fsy199.

———. 2019b. “A new approach for estimating stock status from length frequency data.” ICES Journal of Marine Science 76 (1): 350–51. https://doi.org/10.1093/icesjms/fsy139.

Goudriaan, J. 1986. “Boxcartrain Methods for Modelling of Ageing, Development, Delays and Dispersion.” In The Dynamics of Physiologically Structured Populations, edited by J. A. J. Metz and O. Dierkmann, 1st ed., 453–73. Berlin: Springer.

Hordyk, Adrian, Kotaro Ono, Keith Sainsbury, Neil Loneragan, and Jeremy Prince. 2014. “Some explorations of the life history ratios to describe length composition, spawning-per-recruit, and the spawning potential ratio.” In ICES Journal of Marine Science, 72:204–16. 1. https://doi.org/10.1093/icesjms/fst235.

Hordyk, Adrian R, Jeremy D Prince, Thomas R Carruthers, and Carl J Walters. 2019. “Comment on "a new approach for estimating stock status from length frequency data" by Froese et al. (2018).” ICES Journal of Marine Science 76 (2): 457–60. https://doi.org/10.1093/icesjms/fsy168.

Makino, Mitsutaku, Shingo Watari, Taro Hirose, Kentaro Oda, Masahito Hirota, Atsushi Takei, Michio Ogawa, and Hiroshi Horikawa. 2017. “A transdisciplinary research of coastal fisheries co-management: the case of the hairtail Trichiurus japonicus trolling line fishery around the Bungo Channel, Japan.” Fisheries Science 83 (6): 853–64. https://doi.org/10.1007/s12562-017-1141-x.

Mangel, Marc, Alec D. MacCall, Jon Brodziak, E. J. Dick, Robyn E. Forrest, Roxanna Pourzand, and Stephen Ralston. 2013. “A perspective on steepness, reference points, and stock assessment.” Edited by Kenneth Rose. Canadian Journal of Fisheries and Aquatic Sciences 70 (6): 930–40. https://doi.org/10.1139/cjfas-2012-0372.

Quinn, Terrance J., and Richard B. Deriso. 1999. Quantitative Fish Dynamics. Oxford University Press.

Thorson, James T., Stephan B. Munch, Jason M. Cope, and Jin Gao. 2017. “Predicting life history parameters for all fishes worldwide.” 8. Vol. 27. https://doi.org/10.1002/eap.1606.

Watari, Shingo, Syunji Tokumitsu, Taro Hirose, Michio Ogawa, and Mitsutaku Makino. 2017. “Stock structure and resource management of hairtail Trichiurus japonicus based on seasonal broods around the Bungo Channel, Japan.” Fisheries Science 83 (6): 865–78. https://doi.org/10.1007/s12562-017-1140-y.